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Power-free values, large deviations, 
and integer points on irrational curves 

par H. A. HELFGOTT 

Resume. Soit / e Z[a;] un polynome de degre d > 3 sans racines 
de multiplicite d ou (d — 1). Erdos a conjecture que si / satisfait 
les conditions locales necessaires alors f{p) est sans facteurs puis- 
sances (d — 1)''™°'' pour une infinite de nombres premiers p. On 
prouve cela pour toutes les fonctions / dont I'entropie est assez 
grande. 

On utilise dans la preuve un principe de repulsion pour les 
points entiers sur les courbes de genre positif et un analogue 
arithmetique du theoreme de Sanov issu de la theorie des grandes 
deviations. 

Abstract. Let / e Z[x] be a polynomial of degree d > 3 without 
roots of multiplicity d or (d — 1). Erdos conjectured that, if / 
satisfies the necessary local conditions, then f{p) is free of (d— l)th 
powers for infinitely many primes p. This is proved here for all / 
with sufficiently high entropy. 

The proof serves to demonstrate two innovations: a strong re- 
pulsion principle for integer points on curves of positive genus, 
and a number-theoretical analogue of Sanov's theorem from the 
theory of large deviations. 

1. Introduction 

1.1. Power-free values of f{p), p prime. Let / G 7j[x] be a polynomial 
of degree d > 3 without roots of multiplicity k or greater. It is natural to 
venture that there are infinitely many integers n such that f{n) is free of 
kth. powers, unless local conditions fail. (An integer a is said to be free of 
kth powers if there is no integer b > 1 such that ^'^lo.) In fact, such a guess 
is not only natural, but necessary in many applications; for example, we 
need it to hold with A: = 2 if we want to approximate the conductor of an 
elliptic curve in a family in terms of its discriminant (see [21] and §5, 
for two contexts in which such an approximation is crucial). 

Assume an obviously necessary local condition - namely, that f{x) ^ 
modp'^ has a solution in Z/p^Z for every prime p. If k > d, it is easy 
to prove that there are infinitely many integers n such that f{n) is free of 
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kth powers, li k < d — 1, proving as much is a hard and by-and-large open 
problem. (See [36], [29] and [20j for results for d large.) Erdos proved that 
there are infinitely many n such that f{n) is free of kth. powers for k = d—1. 
Furthermore, he conjectured that there are infinitely many primes q such 
that f{q) is free of {d — l)th powers, provided that f{x) ^ modp'^~^ 
has a solution in (Z/p'^^^Z)* for every prime p. This conjecture is needed 
for applications in which certain variables are restricted to run over the 
primes. Erdos's motivation, however, may have been the following: there 
is a difficult diophantine problem implicit in questions on power-free values 
- namely, that of estimating the number of integer points on twists of a 
fixed curve of positive genus. Erdos had managed to avoid this problem 
for k = d — 1 and unrestricted integer argument n; if the argument n is 
restricted to be a prime q, the problem is unavoidable, and must be solved. 

The present paper proves Erdos's conjecture for all / with sufficiently 
high entropy. As we will see, even giving a bound of 0(1) for the dio- 
phantine problem mentioned above would not be enough; we must mix 
sharpened diophantine methods with probabilistic techniques. 

We define the entrop^ If of an irreducible polynomial / over Q to be 

(1-1) ^f = T^\ ^ 

where Galj is the Galois group of the splitting field of / and is the 
number of roots of / fixed hy g £ Gal/. (We write \S\ for the number of 
elements of a set S.) 

Theorem 1.1. Let / G Z[x] be a polynomial of degree d without roots of 
multiplicity > k, where k = d — 1 and d > 3. If f is irreducible, assume 
that its entropy If is greater than 1. Then, for a randon^ prime q, the 
probability that f{q) be free of kth powers is 

M n(i-^). 

where Pf^^^ip'^) stands for the number of solutions to f{x) = modp'^ in 

{z/p^zy. 



This is essentially a relative entropy, appearing as in the theory of large deviations; vd. jjS] 

o 

Let S be an infinite set of positive integers - in this case, the primes. When we say that the 
probability that a random element q of S satisfy a property P is a;, we mean that the following 
limit exists and equals x: 

\{1 < q < N : q S satisfies P}\ 

lim 

iV-,oo III < g < N : g e S}\ 
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Remark. The probability (jl.2p is exactly what one would expect from 
heuristics: the likelihood that a random prime q be indivisible by a fixed 

prime power is precisely 1 — '^I'^pk-i ■ The problem is that we will have 
to work with a set of prime powers whose size and number depend on q. 

It is easy to give a criterion for the non- vanishing of ()1.2p . 

Corollary 1.2. Let f G be a polynomial of degree d without roots of 
multiplicity > k, where k = d — 1 and d > 3. If f is irreducible, assume 
that its entropy If is greater than 1. Assume as well that no kth power 
, m > I, divides all coefficients of f , and that f{x) ^ modp^ has a 
solution in {'L/p^'L)* for every p < d+1. Then f{p) is free of kth powers for 
infinitely many primes p. Indeed, f[p) is free of kth powers for a positive 
proportion of all primes. 

Remark. An irreducible polynomial / of degree 3, 4, 5 or 6 has entropy 
greater than 1 if and only if its Galois group is one of the following: 

^3,C(4),i5;(4),Z?(4),C7(5), 

^ ■ ' C{G),Dg{Q),D{G),A^{6),Fis{6),2A^{6),Fi8{6) : 2, ^36(6), 2^4(6), 

in the nomenclature of [6]. See Table [H Erdos's problem remains open for 
irreducible polynomials with the following Galois groups: 

S3,A4,S4,D{5),F{5),A5,S5, 

54(6d),54(6c),L(6),F36(6) : 2, L(6) : 2, ^6,^6. 

Remark. We will be able to give bounds on the rate of convergence to 
()1.2p : the proportion of primes q < N such that f{q) is free of kth powers 
equals (|1.2p +0((log A^)""^). 7 > 0. We will compute 7 explicitly in ^ In 
particular, if d = 3 and Gal/ = ^3, then 7 = 0.003567 .... See ^ Tabled 

Remark. The entropy If is greater than 1 for every normal polynomial 
/ of degree > 3. (A polynomial is normal if one of its roots generates 
its splitting field.) In particular, If > 1 for every / with Galj abelian and 
deg(/) > 3. We do have If > 1 for many non-normal polynomials / as well; 
most of the groups in (jl.3p are Galois groups of non-normal polynomials. 
In contrast, for / of degree d with Galj = 5"^, the entropy If tends to 
Y.'k=2 i^rf^ = 0.5734028 ... as d ^ 00. See 

Remark. If we can tell whether or not f,g£ take values free of fcth 
powers for infinitely many prime arguments, we can tell the same for / • g. 
In other words, when we work with a reducible polynomial, the degree and 
entropy of the largest irreducible factors of the polynomial matter, rather 
than the degree of the polynomial itself. We will take this fact into account 
in the statement of the main theorem. 
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D{4) 
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C{b) 
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F(5) 
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^5 
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C(6) 


1.7917595 


A (6) 


1.7917595 


0(6) 


1.2424533 


^4(6) 


1.2424533 


i^l8(6) 


1.3296613 


2^4(6) 


1.3143738 


Si{<dd) 


0.9678003 


54 (6c) 


0.9678003 


i^i8(6) : 2 


1.0114043 


i^36(6) 


1.0114043 


254(6) 


1.0037605 


m 


0.5257495 


i^36(6) : 2 


0.9678003 


L(6) : 2 


0.6094484 




0.5693535 


5*6 


0.5734881 











Table 1. Entropies of irreducible polynomials of degree 3, 4, 5, 6 



1.2. General statement. Theorem 11.11 holds over many sequences other 
than the primes. All we use about the primes is that the proportion of 
them lying in a given congruence class can be ascertained, and that they 
are not much sparser than a simple sieve majorisation already forces them 
to be. 

Definition 1.1. Let 5 be a set of positive integers. We say that 5 is 
predictable if the limit 

, / X . \in £ S : n < N,n = a modm}\ 

(1.4) p(a,m) = hm — ^— ^ — - 

^ ' > y \{ne S :n< N}\ 

exists for all integers a, m > 0. 

The following definition is standard. 

Definition 1.2. Let P be a set of primes. We say that P is a sieving set 
of dimension 9 if 

(1.5) n i^--Y«Ci^ 

w<p<z ^ ^ " 

for all w, z with z > w > 1, where > is fixed. 

We are about to define tight sets. A tight set is essentially a set whose 
cardinality can be estimated by sieves up to a constant factor. 

Definition 1.3. Let 5 be a set of positive integers. Let P be a sieving set 
with dimension 9. We say that 5 is (P, 9) -tight if (a) no element n of 5 is 
divisible by any prime in P smaller than n^, where 5 > is fixed, (b) the 
number of elements of {n G 5 : n < N} is ^ N/{\ogNy for X sufficiently 
large. 
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In other words, 5 is a (P, 0)-tight set if the upper bounds on its density 
given by its sieve dimension 6 are tight up to a constant factor. 

Main Theorem. Let S be a predictable, {P,6)-tight set. Let k > 2. Let 

f G Z[x] be a polynomial such that, for every irreducible factor g of f , the 
degree of g is < kg + 1, where kg = [A;/rg] and rg is the highest power of g 
dividing f . Assume that the entropy Lg of g is > (kg + 1)9 — kg for every 
irreducible factor g of f of degree exactly kg -\- 1. 

Then, for a random element q of S, the probability that f{q) be free of 
kth powers is 

(1.6) hm V ^(m) V p{a,m''), 

z^oo ^ ' ^ ' 

m>l 0<a<m* 

p\m^p<z /{a)=0 modm*: 

where p{a, m) is as in 

The expression whose hmit is taken in (jl.Op is non-negative and non- 
increasing on z, and thus the hmit exists. 

Example. The primes are, of course, predictable and (P, l)-tight, where 
P is the set of all primes. Thus, Thm. 11.11 is a special case of the main 
theorem. In the general case, if the convergence of ()1.4p is not too slow, we 
can obtain bounds for the error term that are of the same quality as those 
we can give in the case of the primes, viz., upper bounds equal to the main 
term times (1 + 0((log iV)"'^)), 7 > 0. 

Example. Let S be the set of all sums of two squares. Then S is predictable 
and (P, ^)-tight, where P is the set of all primes p = 3 mod 4. Since the 
entropy (jl.ip of a polynomial is always positive, we have Lg > d-^ — (d—l) for 
every irreducible g of degree d> 1, and thus we obtain the asymptotic (jl.Op . 
For this choice of S, the techniques in ^ and SI suffice; the probabilistic 
work in f|5] is not needed. The same is true for any other S that is (P, 9) 
tight with 9 < 

Note that we are considering sums of two squares counted without mul- 
tiplicity. A statement similar to (jl.6p would in fact be true if such sums are 
counted with multiplicity; to prove as much is not any harder than to prove 
(|1.6p for 5 = Z, and thus could be done with classical sieve techniques. 

Example. The set of all integers is predictable and (P, 0)-tight, and thus 
the main theorem applies. We will discuss the error terms implicit in 
(|1.6p generally and in detail. Setting S = Z, and deg(/) = 3, we will 
obtain that the total number of integers n from 1 to such that /(n) is 
square-free equals iVnp(l-/0/(p)/p) plus O f{N (log N)-^/^) (if Gal/ = ^3) 
or O f{N (log N)^"^^^) (if Galj = 5*3). See Prop. 17.31 The error terms 
O f{N (log N)-^^^) and Of{N{\ogN)-'^/^) are smaller than those in [22], 
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Thm. 5.1 (respectively, 0/(iV(log A^)-°-S°6^-) and Of {N {log Nf-'^^'^^-)), 
which were, in turn, an improvement over the bound in ^26j, Ch. IV (namely, 
Of{N{logNy/'^)). Analogous improvements also hold for square-free val- 
ues of homogeneous sextic forms; here the strongest result in the literature 
so far was [22j, Thm. 5.2, preceded by the main theorem in jl6j. 

The main theorem would still hold if the definition of a (P, ^)-tight set 
were generalised somewhat. There is no reason why the sieved-out congru- 
ence class modulo p, p £ P, should always be the class a = mod p. One 
must, however, ensure that, for every factor g oi f with deg[g) > 1, we 
get g{a) ^ modp for all but finitely many of the sieved-out congruence 
classes a modp, or at any rate for all p E P outside a set of low density. 
One may sieve out more than one congruence class per modulus p £ P. 
The number of sieved-out congruence classes per p £ P need not even be 
bounded by a constant, but it ought to be constrained to grow slowly. 

1.3. Plan of attack. Estimating the number of primes p for which f{p) 
is not free of kth powers is the same as estimating the number of solutions 
{t, y, x) to ty^ = f{x) with x prime, x, t, y integers, y > 1, and x, t, y within 
certain ranges. The solutions to ty^ = f{x) with y small (or y divisible by a 
small prime) can be counted easily. What remains is to bound from above 
the number of solutions {t,y,x) to ty'^ = f{x) with x and y prime and y 
very large - larger than x(logx)~'', e > 0. It is intuitively clear (and a 
consequence of the abc conjecture; see [15]) that such solutions should be 
very rare. Bounding them at all non-trivially (and unconditionally) is a 
different matter, and the subject of this paper. 

Counting integer points on curves. Let C be a curve of positive genus g. 
Embed C into its Jacobian J. The abelian group J(Q) is finitely generated; 
call its rank r. Map the lattice of rational points of J to MJ" in such a way as 
to send the canonical height to the square of the Euclidean norm. Project 
M'" \{0} radially onto the sphere S^~^. Let Pi, P2 be two rational points on 
C whose difference in J is non-torsion. Mumford's gap principle amounts 
in essence to the following statement: if Pi and P2 are of roughly the same 
height, then the images of Pi and P2 on S^~^ are separated by an angle of 
at least arccos |. This separation is not enough for our purposes. We will 
show that, if Pi and P2 are integral and of roughly the same height, then 
their images on S^~^ are separated by an angle of at least arccos 

The case g = 1 was already treated in [22], §4.7. The separation of the 
points is increased further when, in addition to being integral, Pi and P2 
are near each other in one or more localisations of C. This phenomenon 
was already noted in [23] for g = 1, as well as in the case of Pi, P2 rational 
and g > 1 arbitrary. 
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In section ^ we will use the angular separation between integer points 
to bound their number. This will be done by means of a lemma on sphere 
packings. In our particular problem, Pi and P2 may generally be taken to 
be near enough each other in sufficiently many localisations to bring their 
separation up to 90° — e. We will then have uniform bound^ of the form 
0((logt)'^) for the number of points on a typical fibre ty^ = f{x). 

Large deviations from the norm. Let p be a typical prime, i.e., a prime 
outside a set of relative density zero. Suppose that tq^ = f{p) for some 
prime q and some integer t < p(logp)^. We can then show that t is, in 
some ways, a typical integer, and, in other ways, an atypical one. (We first 
look at how large the prime factors of t are, and then at how many there 
are per splitting type.) The former fact ensures that the above-mentioned 
bound 0{{logtY) on the number of points on ty^ = f{x) does hold. The 
latter fact also works to our advantage: what is rare in the sense of being 
atypical must also be rare in the sense of being sparse. (The two senses are 
one and the same.) Thus the set of all t to be considered has cardinality 
much smaller than p(logpY. 

How much smaller? The answer depends on the entropy If of f. (Hence 
the requirement that If > 1 for Theorem 11.11 to hold.) Results on large 
deviations measure the unlikelihood of events far in the tails of probability 
distributions. We will prove a variant of a standard theorem (Sanov's; 
see [l3] or, e.g., [25j, §11.1) where a conditional entropy appears as an 
exponent. We will then translate the obtained result into a proposition in 
number theory, by means a slight refinement of the Erdos-Kac technique 
(|12j). (The refinement is needed because we must translate the far tails of 
the distribution, as opposed to the distribution itself.) 

Our bounds on the number of t's are good enough when they are better 
by a factor of (logX)*^ than the desired bound of X/{logX) on the total 
number of tuples {t, q,p) satisfying tq'^ = f{p); this is so because our upper 
bound on the number of points per t is in general low, viz., 0{(logtY). 

1.4. Relation to previous work. Using techniques from sieve theory 
and exponential sums, Hooley ([27J, [28j) proved Erdos's conjecture for 
polynomials / € Z[x] of degree deg(/) > 51; for / normal and in a certain 
sense generic, he softened the assumption to deg(/) > 40 ([28], Thms. 5, 6). 
(The results in the present paper apply to all normal polynomials /, as their 
entropy is always high enough; see the comments at the end of ^) Then 
came a remarkable advance by Nair [36] , who, using an approach ultimately 



Bounds such as Mumford's Ochi^og ho) ( |24) . Thm. B.6.5) for the number of L-rational 
points on C of canonical height up to ho would be insufficient: for C fixed and L variable, the 
implied constant is proportional to c'^^^'^^iJi^)) ^ where c > 1 is a fixed constant. The same is 
true of bounds resulting from the explicit version of Faltings' theorem in |3] — the bound is then 
Q^(-7rank(j(L)))^ Our bound is Oc({l + e)™'' J(L)) j^^, ^ typical t. (Here L = Q{t^/'=).) 
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derived from Halberstam and Roth's work on gaps between square-free 
numbers [19], showed that Erdos's conjecture holds whenever deg(/) > 7. 
No other cases of the conjecture have been covered since then. 

It is a characteristic common to the rather different approaches in |27j 
and [36] that Erdos's conjecture is harder to attack for deg(/) small than 
for deg(/) large. If one follows the approach in the present paper, it is not 
the degree deg(/) that is crucial, but the entropy Ij: the problem is harder 
when If is small than when If is large. 

There are results ([36j, [29], [20]) on values of f{n) and f{p) free of kth 
powers, where k = deg(/) — 2 or even lower, provided that deg(/) be rather 
high. This is an interesting situation in which our methods seem to be of 
no use. 

1.5. Acknowledgements. The author would like to thank Alina Cojo- 
caru, E. V. Flynn, Anant Godbole, Christopher Hall and Anatole Joffe 
for their patient assistance, Andrew Granville, for his assistance and en- 
couragement, and Christopher Hooley, for a dare. Thanks are also due to 
writers of free software (|14j) and to an anonymous referee. 

2. Notation 

2.1. Sets. We denote by IS"! the number of elements of a finite set S. As 
is usual, we say that \S\ is the cardinality of S. 

2.2. Primes. By p (or q, or qi, or (72) we shall always mean a prime. We 
write co{n) for the number of prime divisors of an integer n, and 7r(A^) for 
the number of primes from 1 up to N. Given two integers a, b, we write 
a\b°° if all prime divisors of a also divide 6, and a f 6°° if there is some 
prime divisor of a that does not divide b. We define gcd(a, 6°°) to be the 
largest positive integer divisor of a all of whose prime factors divide b. 

2.3. Number fields. Let K he a number field. We write K for an al- 
gebraic closure of K. Let Mk be the set of places of K. We denote the 
completion of -fC at a place v G Mk by Ky. If / G Q[x] is an irreducible 
polynomial, let Galj be the Galois group of the splitting field of /. 

If p is a prime ideal of K, we denote the place corresponding to p by Vp. 
Given x £ K* , we define Vp{x) to be the largest integer n such that x £ p"". 
Define absolute values | • {^^ on Q by = p~^p^^\ If li; is a place of K, 
and Vp is the place of Q under it, then | • |to is normalised so that it equals 
\ • \vp when restricted to Q. 

Given a positive integer n and a conjugacy class {g) in Gal(-fir/Q), we 
write uji^g) (n) for Ep\n,p unramified,Frobp=(g> 1> ^here Frobp denotes the FYobe- 
nius element of p in K/Q. 
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2.4. Curves. As is usual, we denote local heights with respect to a divisor 
D by Xd,v, and the global height hy ho- Let C be a curve over a local field 
Kyj, and let i? be a point on C. We then say that a point P on C is integral 
with respect to (R) if f{P) is in the integer ring of Kyj for every rational 
function f on C without poles outside R. Given a curve C over a number 
field K, a set of places S including all archimedean places, and a point R 
on C, wc say that a point P on C is S -integral with respect to (R) if P is 
integral on C (g) Kyj with respect to (R) for every place w ^ Mk \ S. 

2.5. Functions. We will write exp(x) for e^. We define li{N) = 

2.6. Probabilities. We denote by ¥{E) the probability that an event E 
takes place. 



3. Repulsion among integer points on curves 

Consider a complete non-singular curve C of genus g > 1 over a number 
field K. Embed C in its Jacobian J by means of the map P ^ C1(P) — (Pq), 
where Pq is a fixed arbitrary point on C. Let (•, •) : J{K) x J{K) M, 
I • I : J{K) — >■ M be the inner product and norm induced by the canonical 
height corresponding to the theta divisor 9 G Div(J). Denote by A the 
diagonal divisor on C x C. 

Theorem 3.1. Let K be a number field. We are given a complete non- 
singular curve C/K of genus g > 1 with an embedding P i-^ C1(P) — (Pq) 
into its Jacobian J{C). Let R be a point on C, and let S be any set of 
places of K including all archimedean places. Let L/K be an extension of 
degree d; write Sl for the sets of places of L above S. 

Then, for any two distinct points P,Q E C{L) that are SL-integral with 
respect to (R), 

(P, Q) < ^(|PP + IQP) - ^ max(|Pp, |g|2) 

1 1 

+ 2*^ ~ 2 ^ max(AA,tt,(P, Q), 0) + Oc,K,e,d,R,Po{^) 
weMi\SL 

for every e > 0, where 
(3.2) 

^= XI ^«'(™ax(A(^)^^(P),A(R)_^(g)) -min(A(K)_^(P),A(R),^(g))) 
weSL 

and dw = [Lw : Qp]/[L : Q], where p is the rational prime lying under w. 

The fact that the error term Oc,K,e,d,R,Po{^) does not depend on L will 
be crucial to our purposes. 
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Proof. We may state Mumford's gap principle as follows: 

(3.3) 2g{P, Q) < (1 + + \Q\^) - gh^{P, Q) + Oc,pW^). 

(See, e.g., [33], Thm. 5.11, or [24|, Prop. B.e.fl) Our task is to show that 
the contribution of ghj\{P,Q) must be large. Without it, we would have 
only the angle of arccos ^ mentioned in the introduction, as opposed to an 
angle of arccos (We would not, in fact, be able to do any better than 
arccos ^ if we did not know that P and Q are integral.) 

We will argue that, since P and Q are 5'-integral, their heights are made 
almost entirely out of the contributions of the local heights A^, v G S, 
and that these contributions, minus 5, are also present in h/\{P,Q). Then 
we will examine the contribution of the places outside S to /ia(-PjQ); the 
expression Y2weML\SL '^^ ™3'X(Aa,io(-Pi Q)i 0) will give a lower bound to this 
contribution. 

Write hi^{P,Q) = T.n,du,\^AP^Q) + Oc{l) (as in, say, [24], Thm. 
B. 8. 1(e)). By [47j, Prop. 3.1(b), every X^^w satisfies 

(3.4) Aa,«,(P, Q) > mm{\A,M P), ^A,M Q))- 
We have 

(3.5) Xa,UR^ P) = XiR)AP)^ ^A,«,(ii, Q) = A(j?),^(Q) 

by [37], Prop. 3.1(d). Thus hA{P,Q) is at least 
(3.6) 

max ^ dy,X(^ji),^{P), ^ d^„A(R)^^(Q) -5+ ^ d«,AA,^(P,Q) 

\weSL w&Sl J w(^Ml\Sl 

plus Oc{l). 

We must first show that YlweSL '^wX(^ji)^uj{P) equals /i(j:j)(P) plus a con- 
stant, and similarly for /i(_r)((5). Let w S Ml\Sl. If w is non-archimedean 
and C has good reduction at w, the height A(^)^u,(P) (resp.x A(r)^^„(Q)) is 
given by the intersection product {R ■ P) (resp. {R ■ Q)) on the reduced 
curve C (P^], (3.7)). Since P and Q are integral with respect to (i?), 
both {R ■ P) and {R ■ Q) are 0. Hence 

\r)AP) = \r)AQ) = ^- 
Consider now the case where w is archimedean or C has bad reduction at 
It;. Choose any rational function f on C whose zero divisor is a non-zero 
multiple of R. Since P and Q are integral, both 1/(^)1^; and \f{Q)\w are 
> 1. By functoriality ([24], Thm. B. 8. 1(c)) and the fact that, under the 
standard definition of the local height on the projective line, A(o),u,(a;) = 



There is a factor of i missing before ?»CxC, aC^'i Q) in |24| : cf. |24| . top of p. 218. Note that, 
as |24| states, 113.31 1 is valid even for g = 1. 
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for any x = {xq,xi) on with > x (see, e.g., [M], Ex. B.8.4), it 

^ w 

follows that 



(3.7) A(R),^(P) = Oc7,r,lJ1), \r)AQ) = Oc,r,lJ1)- 

Every place w oi L that is archimedean or of bad reduction must lie above 
a place v of K that is archimedean or of bad reduction. Since there are 
only finitely many such v, and finitely many extensions w of degree at most 
d of each of them (see, e.g., |32j . Ch. II, Prop. 14), we conclude that 

,R,K,d{^), 



(3.8) -1^^ 

h{R){Q) = 2^ du,X(R)^n,{Q) +Oc,R,K,di'^)- 

Now, again by an expression in terms of intersection products, AA,to(-P, Q) 
is non-negative at all non-archimedean places w where C has good reduc- 
tion, and, by (j3.4p . (j3.5p and (j3.7p . it is bounded below by Oc,A,Lu,(l) at 
all other places w. We use both these facts and (|3.8p to bound p.6p from 
below, and we obtain that h/\{P, Q) is at least 

max(/i(^)(P), /i(^)(Q)) - 5 + ^ d^^, max(AA,t«(-P, Q), 0) 

+ Oc,K,R,d{'^)- 

By the argument at the bottom of p. 217 in [23] with R instead of Pq, we 
have |P|2 < g{l + e) + Oc(l), IQP < 5(1 + e) V)(Q) + Oc(l). We 

apply ()3.3p and are done. □ 



The general applicability of Thm. l3.1h s somewhat limited by the presence 
of a term Oc,K,e,d,R,Po{^) depending on the curve C. (For the application 
in this paper, it will be good enough to know that Oc,K,t,d,R,Poi^) does not 
depend on L, but just on its degree d = deg{L/K).) The main obstacle 
to a uniformisation in the style of [23], Prop. 3.4, seems to be a technical 
one: we would need explicit expressions for local heights at places of bad 
reduction, and the expressions available for genus g > I are not explicit 
enough. 

4. Counting points on curves 

We must now clothe ^in concrete language for the sake of our particular 
application. Since the field L in Thm. 13.11 will now be of the special form 
we will be able to give a bound rank(J(L)) in terms of the 
number of prime divisors of t by means of a simple descent argument. We 
will then combine Thm. [3T] with sphere-packing results to give a low bound 
( ()4.ip ) on the number of solutions to ty'^~^ = f{x) with t fixed and typical. 
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Lemma 4.1. Let A{n,6) be the maximal number of points that can be 
arranged on the unit sphere o/M" with angular separation no smaller than 
9. Then, for e > 0, 

1 vr 
lim - log2 A{n, - - e) = 0(e). 

n^oo n Z 

Proof. Immediate from standard sphere-packing bounds; see [ST] (or the 
expositions in f35] and [7], Ch. 9) for stronger statements. In particular, 
0(e) could be replaced by 0(e^loge~^). □ 

When we speak of the rank of a curve over a field K, we mean, as is 
usual, the rank of the abelian group of ii'-rational points on its Jacobian. 

Lemma 4.2. Let f £ Z[x] be a polynomial of degree d >3 without repeated 
roots. Let p be a prime that does not divide d. Let K/Q be a number field. 
Then, for any non-zero integer t, the curve 

Ct : tyP = fix) 

has rank over K at most d{p — : Q] • Lv{t) + Ox,/,p(l)- 

Proof. Let J be the Jacobian of Ct. Let (p be the endomorphism 1 — r of 
J, where r is the map on J induced by the map (x,y) i-^ {x,Cpy) on Q. 
By [B], Cor. 3.7 and Prop. 3.8, 

rankz(J(K)) < |^^|-y^ rank^/p^( J(K(Cp))/0J(K(Cp))). 

By the proof of the weak Mordell-Weil theorem, J (K {(p)) / (pj {K {(p)) in- 
jects into H^{K{(p), J[(l)\; S), where S is any set of places of K{Cp) con- 
taining all places where Ct has bad reduction in addition to a fixed set 
of places. By [Sj, Prop. 3.4, the rank of H\K{Cp), J[(j)]; S) over Z/pZ 
is no greater than the rank of L{Sl,p), where L = K{(p)[T]/(tP~^ f{T)) 
and 5l is the set of places of L lying over S. (Here L{Sl,p) is the sub- 
group of L* / L*'P consisting of the classes modL*^ represented by elements 
of L* whose valuations at all places outside Sl are trivial.) As the roots of 
tP~^f{x) = are independent of t, so is L. Thus, the rank of L{Sl,p) is 
+0_ft',/,p(l) < d-\S\ -\-Okj,p{^), where the term OKj,pi^) comes from 
the size of the class group of L and from the rank of the group of units 
of L. The number of places of bad reduction of Ct over K{^p) is at most 
[K{(p) : Q]u){t) + Oxj,p{l), where OK, f,p{^) stands for the number of prime 
ideals of i^(Cp) dividing the discriminant of /. The statement follows. □ 

Proposition 4.3. Let f £ 'L\x\ be a polynomial of degree d > 3 with no 
repeated roots. Let k > 2 be an integer such that k | d°° . Let t < X be a 
positive integer. Suppose that t has an integer divisor to ^ X^'"", e > 0, 
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such that gcd(to, {k{Disc f))°^) is less than a constant c. Then the number 
of integer solutions to ty^ = f{x) with X^^*" < x < X is at most 



(4.1) 0/AcJeO/.^(-W)nP(p) 

\ p\to 

where uj(t) is the number of prime divisors oft and p{p) is the number of 
solutions to f{x) = mod p. 

The divisor tQ\t here plays essentially the same role as the ideal ^ in the 
proof of Thm. 3.8 in [23j. The main difference is that, in our present case, 
the congruence f{x) = mod to rnakes the cost of considering all possible 
congruence classes x mod to quite negligible. 

The case k\d°°, k not a power of 2 (or, in general, k such that gcd(A;, d) > 
2) is covered by the recent work of Corvaja and Zannier ([8], Cor. 2). Be 
that as it may, we will need only the case k \ d°° , and thus will not use [8] . 
We could, at any rate, modify Lem. HT2] to cover the case p\d by using [39], 
§13, instead of [S], §3. Proposition 14.31 would then cover the case k\d°°. 



Proof of Prop. \4-S\ Choose a prime q dividing k but not d. Define K = Q, 
L = Q(t-'^/'?). Let 5 and Sl be the sets of archimedean places of Q and 
L, respectively. Consider the curve C : = f{x). Denote the point at 
infinity on C by oo. Embed C into its Jacobian by means of the map 
P^{P)- (oo). 

Now consider any two distinct solutions {xo,yo), (xi,yi) to ty^ = f{x) 
with X^~'' < xo,xi < X and xq = xi mod to- Then the points P = 
(xo, t-'^/'^yo)) Q = {xi,t^/'^yi) on C are integral with respect to 5^ and (oo). 
We intend to apply Thm. 13.11 and thus must estimate the quantities on the 
right side of (|3.ip . 

By the additivity and functoriality of the local height ([24J, Thm B.8.1, 
(b) and (c)) and the fact that the point at infinity on P-*^ lifts back to g • oo 
on C under the map (a;, y) ^ x, 

^logX + 0/,g,^(l) < Aoo,«,(P) < i^logX + 0/,,,^(l), 
q q 

i^logX + 0/,g,^(l) < \ooAQ) < ^logX + 0/,g,^(l) 

for w G Sl. We know that ||P|2 - 5/i^(P)| < eh^dP) + Oc{l) and - 
ghoo{Q)\ < e^oo(Q) + Oc(l) (vd., e.g., the argument at the bottom of p. 
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217 in fMl). Hence 



(4.2) 



q q 



q 

Since P = {xQ,t^/'^yo), Q = {xi^t^/'^yi) and tQ\t, we have that, for every 
non-archimedean place w G Ml where C has good reduction, X^^^^iP, Q) > 

— log Itg'^'^U (see, e.g., [MJ, p. 209). Thus, for every prime p|to where C has 
good reduction, 

J^d^AA,^(P,Q) > -Y,d^log\tl% = V^(*o), 

w\p w\p 

where = [L^ : Qp] / [L : Q] . We apply Thm. 13.11 and obtain 



{P,Q) < ^^^^^{glogX + glogX) 



(4.3) 



^i-^<7logX + - logX - V logp-''^*^) + Oy,fe,,(l) 
2gq q 2q ^ 



0( - logX) +0,j,fc,,(l), 



where we use the facts that to ^ X^^^ and that the sum of logp''''^*") over 
all primes p of bad reduction is bounded above by the constant c. 

By (j4.2p and (|4.3p . we conclude that, for X large enough (in terms of c, 
/, A; and e), P and Q are separated by an angle of at least 7r/2 — Oj fc(e) in 
the Mordell-Weil lattice J{L) endowed with the inner product (•, •) induced 
by the theta divisor. By Lemma l4.H there can be at most e*^^*^^^ points in 
R'' separated by angles of at least tt/2 — 0(e). Since the rank r of J{L) is 
bounded from above by Of^k{'~^{t)) (Lemma l4.2p . it follows that there can be 
at most e'^-'''*'^^^^*)^ points placed as P and Q are, viz., satisfying X^~'' < x < 
X and having x-coordinates congruent to each other modulo to- Since ty'^ = 
f{x) implies /(x) = mod to j there are at most O/dlpit,, P^P)) congruence 
classes modulo to into which x may fall. □ 

5. The probability of large deviations 

Our task in this section will be to translate into number theory a state- 
ment (Sanov's theorem, [33]) on the probability of unlikely events. (If a die 
is thrown into the air n times, where n is large, what is the order of the 
probability that there will be fewer than ^ ones and more than ^ sixes? 
The central limit theorem does not yield the answer; it only tells us that 
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the probability goes to zero as n goes to infinity.) The translation resem- 
bles the argument in |12j, though some of the intermediate results must be 
sharpened. 

Let J be a finite index set. For c, x € (M^)'^, define 
(5.1) 

Bs,x = {y G (1^0 • sgn(yj - xj) = sgn{xj - cj) Vj G J s.t. xj / cj}, 

where sgn(i) is as follows: sgn(t) = 1 if t > 0, sgn(f) = — 1 if t < 0, and 
sgn(t) = if t = 0. In other words, B^g is the set of all vectors y that are 
no closer to c than x is: i/j < xj if Xj < Cj, and yj > Xj if Xj > cj. We also 
define 

(5.2) 4-(f ) = l-J2xj+Y,^j log ^ • 

We adopt the convention that, if cj = 0, then log ^ = oo, unless xj also 
equals 0, in which case we leave log — undetermined and take Xj log — to 

Cj J Cj 

be 0. 

The following is a variant of Sanov's theorem. 

Proposition 5.1. Let the rational primes he partitioned into {Pjjjgj, J 
finite, so that, for every j G J, we have the asymptoptic "^p^p. p<jv 1/p ~ 
log log iV, where f £ (M+)"'. Let {Xp}p 

prime ^e jointly independent ran- 



r 



jiug,iug,jv, I c v^^o / ■ ^'-'^ l^'^pSp prime 

dom variables with values in (M^)'^ defined by 

\ p ■ 

(5.3) Xp 



Cj with probability sj/p, 
with probability 1 — Sj/p, 



where s G (Kq^)"', ej is the jth unit vector in R*^ and j £ J is the index 
such that p £ Pj. 

Define c £ (Mq )'' by Cj = rjsj. Then, for all x £ {^qY , 

lim log P V dx, £ i?e>- = -W), 

n^oo log log n \ log log n ^ j 

where Is{x) is as in I15.S\) and 5g denotes the point mass at x G M'^. 

Proof For m > 0, let = ^ Ep<e^^ ^x,- Define </>™(t ) = E (^e^**'^-)) 
for t£R-^. Then 



<^™(mf)=E(e<-*>'"))=n n + 



pGPj 
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Define A(f) = linim^oo ^ log 0m ("i**) • We obtain 
*(J> = E.>1- E log + 

Write A*(y) for tlie Legendre transform sup^^^j{{y,t) — A(t)) of A(i). 
For y G (K^)"^ with yj = for every j & J with Cj = 0, the maximum of 
{y,t) — A(t) is attained at all t G (K^)"^ such that tj = log |^ for every 
j G J with Cj / 0. Thus, inf^^g^^ A*(y) equals 

/ \ 



inf 

Cj=0=>?;j=0 



1- + 



(The equation is valid even if Cj = for some j E J, thanks to our con- 
vention that Xj log{xj/cj) = when Xj = Cj = 0. For y G (l^d)"^ such that 
yj 7^ 0, Cj = for some j G J, the function t {y,i^ — A{i) is unbounded 
above, and so A*(y) = oo.) By the Gartner-Ehis theorem (see, e.g., [25] . 
Thm. V.6, or JU], Thm. 2.3.6), we conclude that 



hm -log(P(Z„ G Bs,^)) = -h{x). 

m—>oo Ttl 



□ 



The following lemma serves a double purpose. It is a crucial step in the 
translation of a probabilistic large-deviation result (in our case. Prop. 15. ip 
into arithmetic (cf. [12], Lemma 4). Later, it will also allow us to apply 
Prop. 14.31 in such as way as to get a bound of (logd)'^ for the number of 
integral points of moderate height on the curve dy^~^ = f{x), where d is 
any integer outside a sparse exceptional set. 

Lemma 5.2. Let f £ TL\x\ he a polynomial. Then, for any A > 0, e > 0, 

there is a function 5/,A,e : (e, oo) — > [0,1] with |log(5(x)| < eloglogx and 
6{x) = o(l/loglogx), such that, for all but Oj^A,e(A^(log A^)~^) integers n 
between 1 and N , 

(a) Wp\f(^n):p<N^W P < ^^ 

(b) J2p\f{n):p>N^(N) 1 + Ep2|/(n):p<7V'5('V) 1 < £ log log A^. 

In other words, the bulk in number of the divisors is on one side, and 
the bulk in size is on the other side. All but very few of the prime divisors 
of a typical number are small, but their product usually amounts to very 
little. 
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Proof. Define j{n) = llp\f(ny.p<N^W P- Let 5{x) = (logx) where 
r = deg(/) and c will be set later in terms of A and e. Then, for any 
positive integer k and all N such that 5{N) < ^, 

^ (log77v(n))'= = Yl 

l<n<N l<n<N \p\f{n):p<NSW 



<^kj N max 




r \ogp 



^3r 



l<j<k \ ^ p 

<^r,k Af(logiV'^(^))'= = iV(logA^)^-^'=/'''^' 

Setting k = lArc^^/e], we obtain that there are Oc,/,A,e (-^(log A^)""^) inte- 
gers n from 1 to such that 77v('^) > -A^^- Thus (jaj) is fulfilled. 
Clearly 

c'^C/W/TivW) = (c2r^c^(/(n)/7jv{n))/(2r) < ^^^^^ g2ra;(d)^ 

d sq.-free, d<C^/N 
d\f{n)/'yN{n) 

where C is the absolute value of the largest coefficient of /. Hence 

^ („2r^\iu{d) 
^C^{/{«)/7iv{n)) < ^2™(d)^^^. Y ^ ^ 

n=l l<n<N d sq.-free d<C\/N 

d<CVN p\d^p>NSW 
d\f{n)h{n) 



li uj{f{n)/-iN{n)) > eloglogiV, then c'^(/W/7iv(n)) > (bg iV)^i°g^ We 
set c = [e~^"^] and conclude that u){f{n)/^N{n)) > e log log for only 
0/^A^e(A^(log A^)""^) integers n from 1 to A^. □ 

Now we will translate Prop. 15.11 into number theory. It may seem sur- 
prising that such a thing is possible, as Prop. [5TT] assumes that the random 
variables it is given are jointly independent. We will be working with the 
random variables Xp, where Xp = 1 if p divides a random positive inte- 
ger n < N, and Xp = otherwise; the indices p range across all primes 
p < z, where z is such that log log z > (1 — e) log log A^. While the variables 
Xp are very nearly pairwise independent, they are far from being jointly 
independent. (Even if z were as low as (log A^)^, they would not be.) 

Fortunately, the events Xp = 1 are so rare {F{X.p = 1) = i) that, for 
a typical n < N, the product d of all p < z such that Xp = 1 is at most 
A^*^. Since d < N^, the variables Xp, p\d, are jointly independent (up to a 
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negligible error term). One cannot rush to conclusions, of course, since d 
depends on the values taken by the variables Xp. Nevertheless, a careful 
analysis gives us the same final result as if all variables Xp, p < z, were 
jointly independent. This procedure is not new; it goes back in essence to 
Erdos and Kac ([T2]). 

Proposition 5.3. Let f £ Z[x] be a non-constant polynomial irreducible 
over Q. Let the rational primes be partitioned into {Pjjjgj, J finite, so 
that, for every j G J, we have the asymptotic '^p^p. p<jv 1/p ~ fj log log A'", 
where f E (Mjj')"'. Assume furthermore that, for all p G Pj, the equation 
f{x) = modp has exactly Sj solutions in Z/pZ, where s G (Tj^)"^ . Let 
ujj{n) be the number of divisors of n in Pj. 



Let Cj = rjSj for j G J. For every x G (R^)-^, let 
Ss,s{N) = {l<n<N: {uJ,{f{n)) - x, log log N) ■ {xj - c,) > Vj G J}. 
Then, for all x € {R^Y , 



Proof (Cf. [12], §4.) Let P{z) = l\p<zP- For d\P{z), let Sd^N) = {1 < 



n < N : gcd(/(n), P(z)) = d}. Applying Lemma 15.21 with f{n) = n, we 
obtain, for A arbitrarily large and e > arbitrarily small. 



where we let z = N^^^^ and set G (0, 1) arbitrarily (say <j = 1/2). We 
will set and use e later; for now, it is hidden in the properties that the 
statement of Lemma 15.21 ensures for the function 5 it has just defined. By 
the fundamental lemma of sieve theory (vd., e.g., [30], Lemma 6.3, or [17j . 
§3.3, Cor. 1.1) and the fact that Lemma [5^2] gives us 6{x) = o(l/ log log x), 
we have, for all d < N'^, 



(5.5) i5,,,(iv)i=(i+o((iogiv)-^)).-n n i^-'^/p)- 



(We use the fact that \{1 < n < N : gcd(/(n), P(z)) = 4| equals |{1 < 
n < N/d : gcd{ f {n), P{z)/d) = 1}|, and estimate the latter quantity by 
a sieve such as Brun's or Rosser-Iwaniec's; we know that the sieve gives 
us asymptotics with a good error term (namely, (logA^)""^) thanks to the 
fundamental lemma.) 




where L^{x) is as in 15. 



(5.4) 




d\P{z) 
d>N'^ 



j&J p<z:'p\d 
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Define the jointly independent random variables {Xp}p prime as in (j5.3p . 
For d\P{z), let Sd,z be the probability that Xp / for all p\d and Xp = for 
all p\P{z)/d. By inclusion-exclusion, Sd,z = a UjeJ Up<z,p\d:p&p,i'^ - Sj/p)- 
Thus 

(5.6) \Sd,z\ = N{1 + 0((logiV)-^)) • Sd,z 

for d < A^*^, and 



d\P{z) d\P{z) 

d>m d<m 
(5.7) =l-(l + 0((logiV)-^)).l ^ 

d|P(z):(i<Af'; 

= O ((logiV)-^) +1 X = O ((logiV)-^) , 

(i|P(2):d>Ar'; 

where we are using (15. 4p in the last line. 

Since the variables Xp are jointly independent, we may apply Prop. 15. 1|. 
and obtain 

lim - — I log f y~] A{d,z)sd,z | = -Ic{x), 

- log log z I ^-^ I 

^ ^ \d\Piz) J 



z—^oo 



where A((i, z) = 1 if | \ol\ogz } j ^ ^c,x and A(d, z) = otherwise. By 
(j53]l . (USD and ([521), it follows that 



^ ^ A(d,z)5,,.(iV)) =-/,(. 

\ d\P{z) J 



(5.8) lim , , 

Af^oo log lO; 

for z = N^^^^ and A sufficiently large, provided that /^(x) be finite. If 
Is{x) = oo, we obtain (j5.8p with lim replaced by lim sup and = /^(x) 
replaced by < —A. 

Lemma E2] states that |log5(A^)| < e log log A^. Thus log log z > (1 — 
e) log log A^. By Lemma 15. 2lfb]l . 

(5.9) J^|?i;,(gcd(/(n),P(z))) -«;,(/(n))| < eloglogiV 

for all but 0(A^(log A^)~^) integers n between 1 and A^. 
We conclude from (jS.Sp and (|5.9p that, if /^(x) is finite, 



(5. 



loglogAT \^Ar^^ J 
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where we use the fact that /^(x) is continuous with respect to the coordinate 
Xj of X when xj ^ Cj, and the fact that the projection of B^^^; onto the j'th 
axis is M when xj = cj. We let e — > and are done. 

Suppose now that I^ix) = oo. We then have (jS.lOp with < —A+Os^.g{e) + 
0/(1) instead of —I^{x) + 0^^^{e) + o/(l). We let A — > oo and e — > 0, and 
are done. □ 

It is easy to generalise Prop. 15.31 so as to let the argument n of /(n) 
range over tight sets other than the integers. (See Def. 11.31 for the defi- 
nition of a tight set.) The means of the generalisation will be based on a 
view of sieves that may be unfamiliar to some readers and thus merits an 
introduction. We will use an upper-bound sieve to provide a majorisation 
of the characteristic function of a tight set (such as the primes). We will 
then use this majorisation as a model for the tight set, instead of using 
it directly to obtain upper bounds on the number of elements in the tight 
set. This model will have the virtue of being very evenly distributed across 
arithmetic progressions. 

We recall that an upper-bound siev^of level D \s a. sequence {\d\i<d<D 
with Ad = 1 and X^d|„ > 0. Since A^ has support on {1, 2, . . . , L>}, 
we have 'Yld\p = ^ for every prime p > D. Thus g{n) = X]d|n -^rf 
jorises the characteristic function of {p prime : p > D}. In general, if 
Xd is supported on{l < d < D : p\d ^ p £ P}, where P is some 
set of primes, g{n) = X^d|„ A^ majorises the characteristic function of 
{n G Z+ : p|n ^ (p ^ P V p > D)}. 

If 5 C Z+ is a (P,0)-tight set (vd. Def. [O]), then S n [N^/^,N] is 
contained in {n £ Z+ : p\n =^ {p ^ P V p > N^/"^)^, where 5 > 
is as in Def. 11.31 We set D = N^/"^^ and obtain that g{n) majorises 
the characteristic function of S n [N'^/'^,N]. Any good upper-bound sieve 
(such as Selberg's or Rosser-Iwaniec's) amounts to a choice of A^ such that 
Yln<N di''^) ^ -^/(log-^)^! where 9 is the dimension of the sieving set P 
(see Def. II. 2p . Now, by Def. 11.31 the fact that S is tight implies that 
\Sn[N^/^,N]\ » iV/(log7V)'^. Thus 

(5.11) \Sn[N^/^,N]\ < g{n) « \S D [N^/^ , N]\. 

n<N 

In other words, g{n) is not just any majorisation of the characteristic func- 
tion of S'n [N^^'^,N], but a tight one, up to a constant factor. 



^ Take, for example, Selberg's sieve A^j. We are using the notation in |30| . §6, and so, by 
Xd, we mean the sieve coefficients, and not the parameters (call them p^j, as in 1301 ) such that 

^d\n ~ (X]d|n Pd^ ■ III .17, and some of the older literature, the symbols stand for what 

we have just denoted by p^j. 



Power-free values, large deviations, and integer points on irrational curves 21 



Proposition 5.4. Let f, Pj, J, f, s, coj and c be as in Prop. \5.3[ Let 

S CZ be a {P,e)-tight set. For every x E define Si^{N) to be 

(5.12) 

{n£S ■.l<n<N, (ujjifin)) - xj log log TV) • {xj - cj) > Vj E J}. 
Then, for all x e {R^Y , 

<'-l'> '"^ ( |(n e S : l'< » < A^ll l^'>-'^'l) ^ -^--'-'>- 



where Is{x) is as in \5.2\) and 5^ is as in Prop. \5.1[ 

The lower bound on the rate of convergence of (j5.13p that can be made 
explicit from the proof below depends on the constants in Def. 11.31 (that 
is, on 6 and the implied constant in the said definition) but not otherwise 
on {P,9). (By a lower bound on the rate of convergence we mean a map 
€ such that the left side of (j5.13p is within e of the right side for all 

N > N,.) 

The proof of Prop. 15.41 is essentially the same as that of Prop. 15. 3t we 
limit ourselves to sketching the argument again and detailing the changes. 



Proof of Prop. \5.4\ Choose an upper-bound sieve of level N'^, < cr < 1, 
with {p G P : p < }, < cr' < 5/2, as its sieving set, where P and 5 
are as in the definition of {P, ^)-tight sets. (For example, choose A^^ to be 
Selberg's sieve. See, e.g., [30], §6.) The proof of Prop. [531 goes through 
as before if one assigns the multiplicities X]d|n to the elements n of 
{1,2,..., A^}, S^^£ and 5^^^. (Choose ? < 1 — a. Redo Lem. 15.21 taking 
into account the new multiplicities. The crucial fact is that the natural 
estimates for X]i<n<iV r|n Sd|n '^d ir given) have very good error terms. 
The irreducibility of / helps us in so far as f{x) = modp, p\x are both 
true for a finite number of primes p, if for any.) We obtain the statement 
of Prop. 15. 3^ with 

(5.14) hm ■ I log (- ^„ ■ \Ss,,\\ = -Isix) 

TV^oologlogiV \El<n<NEd\n^d J 

as the result, and S^^^ counting n with the multiplicity J2d\n '^d- Since 
^^1^ Ad majorises the characteristic function of 5 n [N'^/'^,N], we have 
|%| < \Ss,s\+N^/^. At the same time, as in ^JB), J2n<NJ2d\n^d < 
\Sn[N^/^,N]\. Hence ([SHjl implies (f5l^ . □ 

For S equal to the set of all primes, we could replace limsup and < in 
(j5.12p by sup and = through an appeal to Bombieri- Vinogradov. However, 
we shall not need such an improvement. 
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6. Proof of the main theorem and immediate consequences 

Using the results in ^ we will now show that, if tp^ = f{n) for some 
n € Sn [1, N], some prime p and some t not much larger than N, then either 
n is atypical or t is atypical. Since "atypical" means "rare", we conclude, 
counting either n's or t's, that few n & S H [1, N] satisfy tp^ = f{n) for 
some p prime and some integer t not much larger than A^. 

For this argument to yield anything of use to us, we must make it quan- 
titative and rather precise. It is here that entropies come into play, as 
they appear in the exponents of expressions for the probabilities of unlikely 
events. 

Proposition 6.1. Let f G be a polynomial of degree d > 3 ir- 

reducible over Q. Let S be a {P,9)-tight set of integers. Define J to 

be the set of conjugacy classes (g) of Galj. Let c = 1 1 jjf j j. 1 1 ^ ^ '^n-d 

c' = 1 1 Gaij I •^(g) } ^ ^ ' where A^^^ is the number of roots of f{x) = fixed 
by g £ Galj. Define 

(6.1) 7 = min (max ( — h -lAx), 9 + I^{x 

xex \ \ d d 

where X = Xjgj[min(cj, c^), max(cj, c^)] and L^{x), /^(x) are as in 115. S^) . 
Then 

(6.2) ^\{n e Sn[l,N]: 3p prime, p > such that p'^~'^\f{n)}\ 

is 0((log A^)~'^+'^) for every e > 0. The implied constant depends only on 
f, 9, e, and the constants in Def. \1.3\ for the given set S. 

Proof. There is a t > such that X C [0, tY and /^(x) > 7 for all elements 
X of {x £ O^oY : Xj > t for some j e J}. Let Y = [0,ty. Cover Y by 
all sets of the form B^^rg (see (jS.ip ) with x such that + ^-^^(x) > 7 — e, 
and all sets of the form B^^g with x such that 9 + I^{x) > 7 — e. Such sets 
form a cover of Y by (j6.ip . Since Y is compact and all sets B^^^, B^^^ in 
the cover are open, there is a finite subcover we may choose one such 
finite subcover in a way that depends only on c, , d and 9, and thus only 
on / and e. Write ^ = [J^^x ^c,x U UxeX' ^S',x- 

Define to be the union of ^ and the collection of all sets B^^^ with 
X such that Xk = t for some k £ J and xj = c'j for all j ^ k. Then is a 
cover of (M^)"^. In particular, for every n G Sn [1, A^] such that p'^~^\f{n) 
for some prime p > N'', we have {t(;j(/(n))/loglog A'"}j G B for some B in 

. The set B may be of type B = B^g or B = B^ In the latter case, 
(j5.12p holds with cf instead of c, and so, by Prop. 15.41 n belongs to a set 
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Si, ^{N) whose cardinality is bounded above by a constant times 
(6.3) 

(logA^)-^^'(^)+^ • |5n [l,iV]| <5 iV(logA^)-^^'(^)"^+^ < N{logN)-''+^\ 

where 5 is as in Def. 11.31 (Here we are assuming, as we may, that N 
is larger than some constant depending only on /, e, 5, and the implicit 
constant in Def. 11.31 As in further applications of Prop. [5^31 and Prop. 15.41 
we define the sets P^g^ to consist of the primes p f Disc(/) with specified 
Frobenius element Frobp = {g) E J; we put each prime p\ Disc(/) in its 
own exceptional set Po,p- The exceptional sets will have no infiuence on 
the bounds. The densities hmAr^oo logiogjv Yl^peP ,p<N ^/P °f sets Pj 
are given by the Chebotarev density theorem.) 

Consider the other possibility, namely, that {wj{f{n))} is in a set B^g. 
Let p be a prime > N'^^^ such that p^~^\f{n), and define r = f{n)/p^^^. 
Suppose first that \r\ > N{logN)~°' , where a will be set later. Then 
/-I <^ N'^/\r\ < Ar'^-i(log7V)", and so p <C 7V(log A^)"/('^-^). For every p, 
the number of positive integers n < N with p'^~^\f{n) is < Of{\N/p'^~^~\). 
Since d > 3, 



(6.4) Af^<p<Ar(logAr)"/{<^-i) 



N 



N 



AfE<p<Ar(iogAf)°=/(d-i) 
< N^-' + A^(log AT) VC"!-!)-! . 



Choose a = ^(1 - I,-(x)). Then ^ - I = - + i/.(f)) < -7 + e, 
and thus the contribution of all n with rp'^~^ = f{n), \r\ > A'"(log A^)~", is 
at most Of{N {log N)-^+'). 

Now take the remaining possibility, namely, |r| < A^(logA^)~". Choose 
an integer divisor A: > 1 of d — 1. Let w be the product of q^^i^'^^^i'^ over 
all primes q dividing kDisc{f). Let y = wp^'^~^^^'' , t = r/w^. Then ty^ = 
f{n). Moreover, \t\ < A^(logiV)-°, gcd(t, (/c Disc(/))°^) < (A: Disc (/))'= = 
0/^fc(l), and the number of prime divisors uj{y) of y is also 0/^^(1). 

We may assume without loss of generality that the leading coefficient of 
/ is positive, and thus /(n) will be positive for n larger than some constant 
Of{l). Since y is also positive, t is positive as well. For every j S J, we 
know that u>j{y) = 0/(1) and ujj{f{n)) — ijJj{y^) < ujj{f{n)/y^) = Wj(t) < 
LOj{f{n)). Hence 

u^jif{n))-Ofil)<iVj{t)<Uj{f{n)). 

Define x' G (M^^)-^ by x'j = cj + (1 - e){xj - cj). Then Is{x') > Ig{x) -Of{e) 
and {wj{t)}j G B^{x') for n larger than some constant 0/(1). We may 
ignore all n smaller than 0/(1), as they will contribute at most 0/(l) to 
the final bound on (16.21). 



24 



H. A. Helfgott 



We apply Prop. [5T3l with /(x) = x and NilogN) " instead of N. We 
obtain that t hes in a set S' = Sg^^'{N (log N)~°') of cardinality at most 

(6.5) (logiV)--^=(^')+' • |Zn [l,Af(logA^)-"]| = iV(logA^)-■^^(^)-°+^ 

provided that, as we may assume, N is larger than some constant 0/(l). 

Our task is to bound, for each t £ S' U [1, A^(log A^)~"], how many so- 
lutions {n,y) G (Z^)^ with n < N the equation ty^ = f{n) has. (We are 
also given that gcd(t, (A; Disc(/))°°) is bounded above by Oj^fc(l).) Of S' 
we need only remember that it is a subset of Z"*" with cardinality at most 
(I63|). 

Let 5{x) be as in Lemma [5.2l with A equal to 7 (or greater). Assume that 
n is such that (jaj) and ^ in Lemma 15.21 both hold. (By the said Lemma, 
we are thereby excluding at most Of^A,e{^{^og N)^"^) values of n.) We 
may also assume that f{n) has at most OA(loglogn) prime divisors and 
exclude thereby at most 0(A''(log A^)~^) values of n. We may also assume 
that n > A^^~^ (and exclude an additional set of A''^"'^ values of n). Apply 
Prop. with to = t/ti, where ti is the product of all primes p\f{n) such 
that p < N^W. We obtain that there are at most 0/,fc,,((log A)'^/.'''^(')) 
possible values of n for every value of t. 

Since the number of values of t under consideration is bounded by (|6.5p 
and a has been chosen so that —I^{x) — a = — — \lc{x) < —7 + 0/(e), 
we conclude that there are at most 
(6.6) 

0(A(logA)-^) +0/,fc,,(|5'|(logA)0/.^.AW) NilogNy-f+OfA^) 

solutions (n,y) G (Z"*")^ with n < A to a given equation ty^ = f{n) with 
teS'U [1, A(log A)-" and gcd(t, (k Disc(/))°°) = 0/,fc(l). 

We add the bounds (16. 3p , (|6.5p , (16. 4p and (16. 6p over all elements of the 
cover . Since the cardinality of the cover depends only on f, 6 and e, we 
are done. □ 

Proposition 16.11 was the ultimate purpose of all of the work that came 
before it. The following lemma is far softer. 

Lemma 6.2. Let f £ 'L\x\ be a polynomial of degree d > 1 irreducible over 
Qlx]. Let k > max(d, 2). Then there is a 5 > depending only on d such 
that, for every D > 1, 
(6.7) 

\{1 < n < N : 3p prime, p > D s.t. /|/(n)}| </ A(A"'^ + D"^''"^)). 

Proof. Suppose k > d. If p^|/(?^), then p <^f N'^k. The number of positive 
integers n < N such that p^\f{n) for some prime D < p <^f N*^/^ can be 
shown to be Of{N'^/^) by the same simple argument as in (j6.4p . We set 
5 = 1 — ^ and are done. 
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Suppose now k = d. Again as in (j6.4p . the number of integers n £ [1, iV] 
such that p^\f{n) for some prime D < p <C/ N^~''/^, e > 0, is at most 
Of{N^~T^ + ND^'^'^^^^). If p^|/(n) for some prime p > N^^"/'^, then rp^ = 
f{n), where r is an integer with |r| <tif N"^. Thus, we need only show that, 
for every integer r with |r| N"^, 

(6.8) \{l <n< N :3p prime such that rp'' = f{n)}\ < iV^"^' 

provided that e be sufficiently small. This is an easy bound; a much stronger 
one (viz., ^ A^i/'^ instead of ^ N^~'^'') follows immediately from [3], Thm. 
5. Set 5 = e/k. □ 

Remark. We can actually replace the bound ^ N^^'^^ in the right side of 
(j6.8p by <C , with e' > arbitrarily small, provided that deg(/) > 1. (If 
deg(/) = 1, we have instead ^ {N/ry^^ / log{{N/r)^/^), which can just as 
easily be proven as be proven best: let p vary, and define n in terms of r 
and p.) We may proceed as follows: 

(a) If deg(/) = 2 and k = 2, then the number of points {x, y) G (Z n 
[1,A^])^ on ry^ = f{x) is Oj(logA^). This is a classical result of 
Estermann's ([13], p. 654 and p. 656). (Reduce the problem to the 
case where f{x) is of the form j;^ + /, / 7^ 0, by a change of variables 
over Q. Then count the solutions (x, y) G (Zn [1, N])^ to ry^ — x"^ = I] 
they are bounded by Oi(logA^) because the group of units of Q(\/r) 
is of rank 1.) 

(b) If deg(/) > 2 or A: > 2, the genus of C : ry^ = f{x) is positive. 
Bound the rank of C(Q) by Lem. 14.21 (generalised so as to remove 
the assumption p \ d] see the comment after the statement of Prop. 
14. 3|] . Bound the number of integer solutions to ry^ = f{x) with 

a < 1, as in Prop. 14.3} the auxiliary divisor Iq 
is not needed, as we do not aim at estimates as delicate as before. 
We obtain a bound of Of^k{(}ogNY), c > fixed, for the number of 
integer points with x in the said range. Vary a as needed. 

We obtain Lemma 16.21 with 6 = — e, where e > is arbitrary. (The 
implied constant in (j6.7p then depends on e.) 

Alternatively, we could bound the number of rational solutions to ry^ = 
f{x) of height 0(log A^) by Cor. 4.3 and Lem. 4.4 of [22] and Prop. 3.6 of 
|23j , say, and then bound the number of integer solutions by the number of 
rational solutions. The resulting bound would still be 0{N^) on the average 
of r, and so we would still get 6 = -^r^ + e, e > 0. 

As it happens. Lemma 16.21 in its presently stated form (that is, with 
5 > unspecified) is all we shall need; even in the explicit result for prime 
arguments (Prop. 17. 4p . the error terms would not be affected by any im- 
provements on Lemma [621 The argument just sketched in this remark was 
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well within the reach of previously known techniques; it has been included 
only for completeness. 

We are now ready to prove the main theorem. Given Prop. 16. H what 
remains is quite straightforward. 

Proof of Main Theorem. Our main task is to show that, for every e > 0, 

(6.9) |{1 < n < iV : 3p prime, p>N^ such that /|/(n)}| = Of^s,e{N). 

Let / = c/[^ • • • fl\ where c £ ri < fk and the /j's are irreducible poly- 
nomials in Z[x] coprime to each other. Then, for q larger than a constant, 
we may have q^\f{p) only if q^\fl^ for some i S {1,2,... ,/}. Note that 
q^\fl^ implies q^^\fi, where ki equals [^], which, by the assumption in the 
statement of the theorem, is at least deg(/j) — 1. Thus, for the purpose of 
proving (j6.9p . we may assume that / is irreducible and k > deg(/) — 1. 

lik> deg(/), then (|6.9p follows immediately from Lemma l6.2i Suppose 
k = deg(/) — 1. By Prop. 16. H we need only check that 7 as defined in 16.11 
is greater than 0. Since I^{x) is continuous on x in the domain on which 
it is finite, it is enough to check that + ^^-fc(c') > ^, as it will then 
follow that + -^p^^Is{x) > 9 + e' for some e' > and any x in some open 
neighbourhood of c?, and, by (|5.2|) . 9 + I^{x) > 6 + e", e" > 0, outside that 
neighbourhood. 

We must, then, show that + ^qr[^c(c') > ^- Now, 

I,-(C-') = 1-Y1 C\g) + C[g) log 7^ = E %) \9) 

{9) (g) (9> 

= E I (9)\\g) log A<,) = ^ Yl ^9 l°g = 

where we use the fact that = 1 (by the Cauchy-Frobenius Lemma, 

or, as it is incorrectly called, Burnside's Lemma; see [37]). By one of the 
assumptions in the statement of the present theorem, If > [k + \)9 — k. 
Thus, + f > 9. We are done proving (fOj) . 
Since S is (P, 0)-tight, we have, for every p < N'', 

\{l<n<N: /|/(n)}| = 0/(iV// + 1), 

where we use an upper-bound sieve with sieving set P\{Pn {p}) to bound 
the cardinality on the left. (The bound on the right is attained by the 
definition of (P, 0)-tightness.) Thus, for any z > 0, 
(6.10) 

\{l<n< N -.Bp prime, z < p < N' s.t. /|/(n)}| = Of^N/z''-'^ + N"). 
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Let Af^k^z{]^) be the set of integers n e S H [I, N] such that p'' f f{n) for 
every p < z. Since S is predictable (see Def. II. ip . 

N-,00 sn[i,N] ^ ' N-*oc LSn i,iv 

p\m=>p<z 

m<l Q<a<m'= 
p\m^p<z /(a)=0 modm* 

where the rate of convergence depends on which is here held fixed. Let 
Af^k{N) be the set of integers n ^ S r\[l,N] such that f{n) is free of fcth 
powers. By ([63]), (l6lTO and (l6TT|) . 

m<l 0<a<m'= 
p\m^p<z /(a)=Omodm* 

for every z. Choose e = (say). We let z go to infinity with A'' as slowly 

as needed, and conclude that 

(6.12) 

\Af,,{N)\ = \Sr^[l,N]\. Y E p{a,m')+Of,s{N). 

m<l 0<a<m* 
p\m^p<z /(a)=Omodm'= 

□ 

Remark. The implied constant in (j6.12p depends only on /, on the con- 
stants in Def. 11.21 and on the rate of convergence in (jl.4p for the given set 
S (as a function of m). 

Proof of Theorem The primes are a {P, 0)-tight set with 9 = 1 and P 
equal to the set of all primes; the constant 6 in Def. 11.31 is 1. By the prime 
number theorem, the expression p{a, m) in (jl.4p equals if gcd(a, m) = 
1 and if gcd(a, m) 7^ 1. In particular, the primes are predictable. Apply 
the main theorem. Since 0(m) is multiplicative, the expression (jl.6p equals 
dH- □ 

Proof of Corollary Let c be the greatest common divisor of the coef- 
ficients of /. For every p > d + 1, the equation -;j^f{x) = modp has 

at most d < p — I solutions in Z/pZ. Hence -;;^f{n) ^ modp for 

some integer n not divisible by p. Clearly f{n) ^ modp^''^'^^'^"'^. Since 
Vp{c) < k, we conclude that f{x) ^ modp*^ has a solution in (Z/p^'E)* for 
every p > d + 1. We are given, by the assumption in the statement, that 
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f{x) ^ modp'^ has a solution in (Z/p'^Z)* for every p < d+1 as well. We 
obtain that no factor of (|1.2p is 0. 

For p sufficiently large, f{x) = uiodp^ has at most k solutions in 
(Z/p^7j)*, by Hensel's Lemma; thus 

and so we see that (|1.2p does not converge to 0. Apply Thm. II. li □ 

Now let us show that, as was remarked at the end of §1.11 the entropy 
// is greater than 1 for all normal polynomials / of degree > 3, and, in 
particular, for all / with Galj abelian and deg(/) > 3. 

Let / G be a polynomial of degree d > 3. Suppose / is normal, i.e., 
I Galj I = d. By the Cauchy-Frobenius Lemma, | q j^^^ | X^geGai/ ^9 ~ ^- 
the same time, \e = d for the identity element e G Gal/. Hence Ag = for 
every g G Galj other than the identity. So, 

(6.13) // = -i— A<;logAg = i-dlogd = logd>l. 

' ^ ' geGalj 

Note that log d is the largest entropy a polynomial of degree d can have. 

A transitive abelian group on n elements has order n (see, e.g., [45] . 
10.3.3-10.3.4). Thus, every polynomial / with Gal/ abelian is normal, 
and, by the above, its entropy // is log(deg(/)) > 1. 

Lastly, let us compute the entropy of / with Gal/ = Sn and n large. 
(A generic polynomial of degree n has Galois group Gal/ = Sn-) Let us 
define the random variable Yn to be the number of fixed points of a random 
permutation of {1, 2, . . . , n}. Then, for / irreducible with Gal/ = 

n 

If = Yf{Yn = k)k\ogk. 

k=l 

It is easy to show that the distribution of Yn tends to a Poisson distribution 
as n — > oo; in fact, by, say, [T], pp. 1567-1568, 

^1 



max 

0<fc<n 



nYn = k)-'— 



< 



2n+l 



{n + iy. 



Thus, // = Ylk=i l^klog k + o(l), and so 

oo 

(6.14) lim //• = V log k. 

?i— +00 — ' kl 



k=l 



Numerically, Xl^i ^ = 0.5734028 .... Since this is less than 1, the 

conditions of Thm. [LT] are not fulfilled for / with deg(/) = n, Gal/ = Sn, 
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n large; some simple numerics suffice to show the same (namely, If < 1) for 
/ with deg(/) = n, Galj = Sn, n small. This is unfortunate, as a generic 
polynomial of degree n has Galois group Sfi- 

7. Rates of convergence and error terms 

We now wish to bound the error terms implicit in the various cases of 
the main theorem discussed in ^ We must first compute the quantity 7 
defined in (j6.ip . 

This computation may seem familiar to those who have seen the theory 
of large deviations being used in hypothesis testing. Let us go through a 
simple example of such a use. Some believe that the variable X follows 
a certain distribution, centred at o, say; others believe it follows another 
distribution, centred at b. Both parties agree to fix a threshold c (with 
a < c < b) and take n observations of the variable X. If the sample mean 
Sn = ^{Xi + . . .+Xn) turns out to be less than c, the contest will have been 
decided in favour of the distribution centred at a; if Sn > c, the distribution 
centred at b will be held to be the correct one. The question is: where is 
the best place to set the threshold? That is, what should c be? 

Denote by ¥a{E) the probability of an event E under the assumption that 
the distribution centred at a is the correct one, and by ^b{E) the probability 
if the distribution centred at b is the correct one. Then we should set c 
so that max(]P(2 (*S*77, ^ c),IP^(»S'^ <^ c)) is minimal; that way, the likelihood 
of resolving the contest wrongly will be minimised. (We are making no 
a priori assumption as to the likelihood of either party being correct.) 
This minimum will usually be attained when ¥a{Sn > c) = ^biSn < c). 
Actually computing c is a cumbersome task; it is rare that there is a closed 
expression either for the minimum of ma,x(¥a{Sn > c),Pf,(S'„ < c)) or for 
the c for which it is attained. 

In our context, we have that any value of d in dy'^ = f{p) will be unlikely 
either as an integer or as a value of f{p) divided by the square of some 
prime. We must set a threshold of some sort and be able to say that, if 
d falls under it (in some sense), it must be unlikely as an integer, and, 
if it goes over it, it must be unlikely as a value of f{p) divided by the 
square of a prime. It will be best to set the threshold so that the maximum 
of the two likelihoods will be minimised. (Matters are complicated by 
the facts that, in our problem, one of the distributions starts "(logA^)"^ 
ahead" (0 = 1 in the case of prime argument); thus we have ()6.ip instead of 
miuc max(Pa('S'n > c),Pb(S'„ < c)).) The minimum of the maximum of the 
two likelihoods can usually be attained only when the two likelihoods are 
equal; we have to minimise them on the surface on which they are equal. 
(We will be working in several dimensions. Thus, the fact that the two 
likelihoods are equal defines a surface.) 
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Again, it is difficult to give closed expressions for all constants, but we 
shall always be able to compute all exponents - and, in some particular 
cases, fairly simple expressions can in fact be found; see the third note 
after the proof of Proposition 17.41 



Lemma 7.1. Let c, c? G (M,]')'^, J finite. Defi' 

f d 1 1 

(7.1) g{x) = max ( — h -I^x), 9 + I^{x 



ne 



^ d d 

where Ic{x), I^{x) are as in ^5.S\) . Define 

X = Xjgj[min(cj, c^), max(cj, Cj)]. 
Then the minimum of g{x) on X is attained when and only when 



if Cj =0 or c'. = 0, 



(7.2) 

where a is the solution in [0, 1] to 

(7.3) E c,«(c;.)^-" f ^ 

if there is a solution in [0, 1] (in which case it is unique). If ([7.3^ has no 
solution in [0,1], then a is either or 1, depending on which of the two 
resulting choices of x (as per |7.^ j gives the smaller value of g{x). If the 
sum in ([7.3[j has no terms, then x is the zero vector. 




When (17. 3p has no solutions in [0, 1], the minimal value of g{x) is easy 
to describe: as we shall see, it equals 



(7.4) max ( 1 - ^ ^ Cj, 6* + 1 - > c' 



where Jo = { j G J : Cj , / 0} . 

No matter whether (j7.3p has a solution in [0, 1] or not, the minimal 
value of g{x) will be greater than 9 if and only if + ^/^(cf) > 9. This 
is easy to see: if + ^/^'(c') < 9, then g{(^) < 9, and so ming(x) < 9] if 
^ + \ls{(^) > 9, we have ^ + ^/?(x) > for x in a neighbourhood of 
c', and 9 + Iff{x) > 9 outside the neighbourhood. 



Proof of Lemma \7.1\ For every j £ J such that cj = or c^- = 0, the 
variable xj is forced to be zero for all x G X such that g(x) < oo. At the 
same time, if Xj = 0, the terms involving Xj make no contributiorH to either 



^We set the convention OlogO = when Ig{x) was defined. See the comment after 1 15.21 1. 
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Ic{x) = 1 - Ej + log T- ^s' {x) = l- Ej + Ej log ^ . (See 

the convention on OlogO chosen after (j5.2p .) Hence, we may redefine J to 
be {j G J : Cj,c'j ^ 0}, and thus reduce the problem to the case in which 
Cj,c'j / for ah j £ J. We assume, then, that Cj,c'j ^ for all j £ J; 
consequently, /^(x) and I^{x) will be smooth on (M^)'^, which is an open 
superset of X. 

Define gi{x) = + ^/^(x), g2{x) = 9 + I^{x). Then g{x) equals 
max(gi{x), g2{x)). If g(x) is minimal on X at x £ X, then it is minimal 
on (R"*")"^ at x: the partial derivatives -^gi{x), -^g2{x) are negative for 
Xj < min(cj,c^) and positive for Xj > max{cj , c'j) . The minimum of g on 
(R"'')"^ may be attained at a point x where 

(a) gi{x) has a local minimum, 

(b) g2ix) has a local minimum, or 

(c) gi{x) = g2{x). 

(There are no other cases: if none of the above were to hold, a small dis- 
placement in X will decrease whichever one of gi{x) or g2{x) is greater, and 
thereby decrease g{x) = max{gi{x), g2{x)) from its supposed minimum.) 
The only local minimum of gi{x) on (M"^)'' is at x = c, and the only lo- 
cal maximum of g2ix) on (M+)'^ is at x = c?. It remains to consider case 
(jcj). Then g reaches a minimum on (R"'')'^ at a point x on the surface S 
described by the equation gi{x) = g2{x). By restriction, g reaches a min- 
imum on 5 n (M"*")"^ at x. Now, on S" n (M^)*^, the function gi{x) equals 
g{x). It follows that Vgi{x) is perpendicular to 5, and thus Vgi{x) is a 
scalar multiple of V{gi{x) — 172 (x)). In other words, one of V(7i(x), Vg2{x) 
is a scalar multiple of the other. Now 

Vg2{S) = V{e + I^{x)) = Vl^ix) = I log ^ I . 

We conclude that Xj = c"(c^)"'^~" for some a. Since x £ X, we know that 
a must be in [0, 1]. As we have already seen, Xj = c"(c^)^~" holds in cases 
((aj) and ^ just as well, with a = 1 and a = 0, respectively. 
Our task is now to find the minimum of 

<7(K(c;.)^-"}) = max(5i({c^(c^)i-"}), <72({c^(c^)^-"})) 

for a £ [0,1]. The map hi : a i—^ gii{c'j {c'j)^~"}) is increasing, whereas 
/i2 : a ^ 52({Cj (cj)^~"}) is decreasing. Thus, 

(7.5) 5i({c,"(c;)^-"}) = <72({c,"(4)^-"}) 
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for at most one a £ [0,1], and, if such an a exists, fl'({c°(c^)^ "}) attains 
its minimum thereat. Writing out g\ and 52 ! we see that (j7.5p is equivalent 
to (E3D. 



If (j7.5p has no solution a within [0, 1], the minimum of 5({Cj (c^)^~"}) 
is min(max(/ii(0), /i2(0)), max(/ii(l), /i2(l))), which, since h\ is increasing 
and /i2 is decreasing, equals max(/ii(0), /i2(l)), which, written in full, is 



max 




'J 



This is nothing other than (j7.4p . □ 

Thanks to Prop. 16. 1^ Lem. 16.21 and Lem. 17.11 we finally know how to 
bound the number of elements n G 5 (5 a tight set) such that ]^\f{ji) for 
some large prime p. Our end is to estimate the number of elements n £ S 
such that /(n) is free of A;th powers. The remaining way to the end is rather 
short. 

Lemma 7.2. Let f £ Z[x] be a polynomial. Let k > 2. Let S be a 

predictable, {P,9)-tight set. Then the number of elements n £ Sn [1,A'"] 
such that f{n) is free of kth powers equals 

fi{d)\{n G 5n [l,iV] : d'^ I /(n)} I +0/(Z)2 + Z)-('=-i) (log L>)^iV) 

(7.6) d<D 

+ 0{\{n £ Sn[l,N]:3p> s.t. /|/(n)}|) 

for every D > 2 and some c > depending only on deg(/). The second 
implied constant is absolute. 

Cf. [22], Prop. 3.4. 

Proof. Apply the riddle in [22j, §3 (that is, [22], Prop. 3.2) with ^ equal 
to the set of all primes, =2/ = S'n[l, A^], r(a) = {p prime : p^\a}, f{a,d) = 1 
if d = 0, and /(a, d) = for d non-empty. Use the bound 

\{n €Sn[l,N]: d^|/(n)}| < {1 < n < iV : d'\fin)} «^ ^i^^^J^^'^ + 1. 

□ 

Now it only remains to reap the fruits of our labour. In order to avoid 
unnecessarily lengthy and complicated statements, we will give the explicit 
results below for irreducible polynomials / alone. They can be easily re- 
stated for general polynomials in the manner of the statement of the main 
theorem. 
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Proposition 7.3. Let f G Z[a;] be a polynomial. Let k > max(2, deg(/) — 

1). //fc = deg(/)-l, then 

(7.7) 

|{1 < n < AT : d'^lfin) ^ d = 1}| = iV • J] (^1 - 



+ Of,k,e N{logN) 



(T(Galj-) 
deg{/)|Gal^ 



for every e > 0, where Pf{p^) is the number of solutions to f{x) = modp*^ 
in Wj/p'^'L and (T(Gal/) is the number of maps in Gal/ that have fixed points. 
If k > deg(/), then 

\{l<n<N: d^\f{n) ^ d = l]\ = N - 
(7.8) P V J 



for some 5 depending only on deg(/). 



Remark. The value of 6 in the right side of (j7.8p may be chosen to be 
1 — '^^^^'^'^^^f^^ — g'^ with e' > arbitrarily small: sharpen Lem. 16.21 as 
detailed in the remark after its proof, and then apply Lemmas 16.21 and 17.21 
with D = ivV(fc+i)_ 

Proof. Apply Lemma 17.21 with D = N^' , where 6' > will be chosen later. 
The error term |{n G S'n [1, iV] : 3p > D"^ s.t. p''\f{n)}\ in (I7.6p is at most 
N times (16. 2p . and thus can be bounded by Prop. [6?Tl The exponent 7 in the 
bound on (16. 2p in Prop. [6TT] can be determined by Lemma l7.lt it amounts 

to 1 — deg{^)\Galf I • ^^^^ with the main term ^^<;^5' p{d) • |{1 < n < 

N : d^\f{n)}\; we wish to show that it equals N ■ flp (^1 — ^^^f plus a 
small error term. 

Since \{1 < n < N : d^\n}\ = N/d^ + 0(1), we have 

Kd) ■\{l<n<N: d'=|/(n)}| = n{d)pf{d') + 0(1)) , 

d<N^' d<N^' 

where Pf{m) = Y\p\mPf {p^^^^^)- The right side equals 
(7.9) 



+ E Hd)Pf{d'')\+N 



d'^ 

ydKN^' d>N^' 
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By \fi{d)pf{d'')\ < np|dlP/(/)l </ (deg/)'^W, we have that (USD equals 
• n, (l - ^) plus 



plus the error 

term 0(iV2<5' _^^-(fc~i)5'(iQg^)c^) 

coming from Lemma [7.2i 

We set = ^ and are done. □ 

Proposition 7.4. Let f € Z[x] be a polynomial. Let k > max(2,deg(/) — 

1). //fc = deg(/)-l, then 

(7.10) 

\{q prime, q < N : d^\f{q) ^ d = l]\ = 7r{N) • J] " p/'l^pk-i ) 

+ ^(iV)-0/,,,,((logiV)-^+^) 

for every e > 0, where 7:{N) is the number of primes from 1 to N, Pf^^^ip^) is 
the number of solutions to f{x) = modp*^ in (Z/p^Z)*, and 7 = g{x) — 1, 
where g is as in |7. j[ j and x is as in ((7.^[ ), with c and as in Prop. \6.1[ 
We have 7 > «/ and only if If > 1. 

If k > deg(/), then, for every A> 0, 
(7.11) 

\{q prime, q < N : d'^\f{q) ^ d = l]\ = ^(iV) - ^^r^) 

p V P P J 

+ Of,kA {N{\ogN)-^). 

Proof. Proceed as in the proof of Prop. [7131 with D = (log A^)"^; use Siegel- 
Walfisz to estimate \{q prime, g < N : d^\f{q)}\ioi d< D. □ 

Remark. Weaker effective results can be used instead of Siegel-Walfisz; if 
the best such results are used (see, e.g., [9], §14, (9), and §20, (11)) then, 
as can be shown by a simple computation, the error term in (jT.lOp remains 
unaltered for deg(/) < 6. 

Remark. See Table [2] for the values of 7 for deg(/) < 6. If deg(/) > 
6, Nair's result ([36], Thm. 3) applies; its error term is no larger than 
OA{N{\ogN)~'^), where A > is arbitrarily large. 

Remark. Let / be irreducible and normal] that is, assume its degree d 
equals the degree |Gal/| of its splitting field. As seen in (|6.13p . we have 
// > 1, and so 7 > in (jT.lOp : in other words, the error term is smaller 
than the main term. Because the structure of Galj is particularly simple, 
we shall be able to give a fairly uncomplicated expression for 7. 

For X € (— e~-^,0), let W-i{x) be the smaller of the two solutions y 
to ye^ = X. (As can be gathered from the notation, W-i is one of the 
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branches of the Lambert W function.) Let c, S be as in Prop. \67\] By 
the Cauchy-Frobenius formula and the fact that / is normal, the only map 
in Gal/ with any fixed points is the identity. Thus c'^^^ = for g ^ e and 

c'^g^ = 1, while c^e) = 3' Therefore, (j7.3p can be rewritten as 
/d-1 , , , /I d-1 

(We have 9 = 1 because we are working on the primes.) We let y = 
-log(d)a - + I^y) and rewrite (f7T^ as 

_Md-2)/id-l) 

Since d-l> for d > 3 (as is our case), the right side of ([713]) 

is in the range (— e~"^,0), and thus y = W-i ^^^^^^rjy— • Hence 

Now ()7.2p gives x^g) = c?~", x^^^ = for (7 7^ e and (jT.ip yields 

7 = 5(^) - 1 = ^c'(^) = 1 - ^ ^ 
d log d 1 



Thus, for d large, 7 ~ -^^fp- — ^jtT) which goes to as d ^ 00. 

Corollary 7.5 (to Prop. 17. 4p . iei / G 6e a cu6ic polynomial irre- 
ducible overQ. Suppose that its discriminant is a square. Then the number 
of primes q from 1 to N such that f(q) is square-free equals 

-(^) • n (1 - + oM^) ■ (iogiv)-^+^) 

for every e > 0, where 

(a) 7r(A^) = A^/ log + 0(A^/(log A^)^) is the number of primes up to N, 

(b) Pf^^:{p^) is the number of solutions to f{x) = modp^ in {'L/p^'L)*, 

(c) 7 equals 1 — 3~" + 3~"log3~" > 0, where a is the only solution in 
[0,1] to 3-(|-(logl)-(l + |a)) = l. 

Numerically, 7 = 0.003567 .... 

Proof. Since the discriminant of / is a square, the Galois group Gal/ of / 
is ^3. Apply Prop. [731 □ 
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Gal/ 7 


Gal/ 7 


Gal/ 7 


^3 0.0035671 






C(4) 0.0265166 


^(4) 0.0265166 


i?(4) 0.0006060 


C(5) 0.0417891 






C(6) 0.0505865 
^4(6) 0.0104233 
Fi8(6) : 2 0.0000529 

Table 2. Values of 


1)6(6) 0.0505865 
Fis{6) 0.0170657 
F36(6) 0.0000529 

7 for 9 = 1 and / of 


L>(6) 0.0104233 
2^4(6) 0.0157592 
254(6) 0.0000059 

degree d < 6 with 



If > 1. The number of primes p < N with f{p) free of 
(d — l)th powers equals a constant c/ times 7r(Ar)(l + 
0((logiV)-^+^)). 



Gal/ 


7 


Gal/ 


7 


Gal/ 


7 


^3 


0.3888889 


53 


0.2777778 




C(4) 


0.4375000 


E(4) 


0.4375000 


^(4) 


0.4062500 




0.3125000 


($'4 


0.3437500 






C(5) 


0.4600639 


D(5) 


0.3800000 


F(5) 


0.3400000 


^5 


0.3800000 


55 


0.3733333 






C(6) 


0.4728484 


I?6(6) 


0.4728484 


D(6) 


0.4444444 


^4(6) 


0.4444444 


i^l8(6) 


0.4537037 


2^4(6) 


0.4513889 


£•4(6^) 


0.4305556 


5*4(60) 


0.4305556 


Fi8(6) : 2 


0.4351852 


i^36(6) 


0.4351852 


254(6) 


0.4340278 


L(6) 


0.3888889 


i^36(6) : 2 


0.4259259 


L(6) : 2 


0.4027778 


^6 


0.3935185 


56 


0.3946759 











Table 3. Values of 7 for = ^ and / of degree d < 6. 
The number of sums of two squares q < N with f{q) free 
of (d — l)th powers equals a constant c/ times tu(A^)(1 + 
0((log A'')"'''"'"'^)), where w{N) is the sum of integers up to 
N that can be written as sums of two squares. 



Proposition 7.6. Let / G Z[x] be a polynomial. Let k > max(2,deg(/) 
1). Let S be the set of all integers that are the sum of two squares, 
k = deg(/) — 1, then 
(7.14) 

\{neSn [1, N] : d'^lfin) ^ d = 1}| = w{N) ■ H^l - p/,o(/)) 

p 

+ Of,k,e M7V)-(log7V)-7+^), 
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for every e > 0, where 

/ \ -1/2 

N 



w{N) 



p 



-2\ 



|5n[i,iV]|~ |2. H (1 

p=3 mod 4 

Po(a,/), 

/(a)=0 modp* 

+ p~^) if p = 3 mod 4, Vp{a) even, Vp{a) < k, 

if p = 3 mod 4, Vp{a) odd, Vp{a) < k, 



Po{a,p^ 



if p = 3 mod 4, Vp{a) even, Vp{a) = k, 
if p = 3 mod 4, Vp{a) odd, Vp{a) = k, 
otherwise, 



and 7 = g{x) — ^, where g is as in |7. j[ j and x is as in ( 7.2), with c and cf 
as in Prop. \ 6.1[ 

Ifk> deg(/), then, for all A> 0, 



(7.15) 



\{neSn [1, N] : d'^lfin) ^ d = 1}\ = w{N) • - pf^p'')) 



+ Of,k,A (iV(logA^)-^) 



where w{N) and pj o are as above. 



Proof. Proceed as in the proof of Prop. [731 Use [42j, Hilfsatze 10 and 12, 
to show 

(7.16) |5n [1,A^] n (a + mZ)| = po{a,m) ■ |5n [l,iV]| +OA{Ne-''^^) 

for a, m with gcd(a, 2m) = 1, where po{a,m) = Wp\mP°i^^P'"^^'^'^) ^"^^ ^ 
a positive constant. Extend (|7.16p to the case gcd(a, 2m) 7^ 1 by direct use 
of the fact that n G 5 if and only if Vp{n) is even for every p = 3 mod 4. □ 

Remark. Since + ^/?(x) > > | for all x, we have 7 > 6* for 
9 = 0.5 and / arbitrary. Hence, the error term in (j7.14p is smaller than 
0(A^(log A^)~i/^+'^), e > arbitrary, and thus it is smaller than the main 
term for all / for which the infinite product in ()7.14p does not vanish. 

The values of 7 for deg(/) < 6 are in Table El Most entries in the table 
are rational; this is because, when d = ^ and c, c' are as in Prop. 16.11 the 
minimum of (|7.ip is reached at x = c' for many (but not all) /. 

* * * 

The approach taken in this paper can be applied to improve upon the 
error term given in |22J for the estimated number of pairs of integers 
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G such that f{x,y) is square-free, where / is a sextic ho- 

mogeneous polynomial. Asymptotics were first given in [16], with the 
error term 0(A^^(log A^)~^/'^); soon thereafter, K. Ramsay [H] attained 
0(A^^(log A^)~^/'^) by means of a slight modification in the argument. The 
error term in [22j depends on the Galois group of /(x, 1); for / generic, it 
is 0(A^^(log A^)"''''^"'*^'"). We can do better now for every /. 

Proposition 7.7. Let f € be a homogeneous polynomial of degree 

6 irreducible in Q[x,y]. Then the number of pairs of integers {x,y), 1 < 
x,y < N, such that f{x,y) is square-free equals 



(7.17) n (l - ^) + Of. (iV^(log iV)- 



where G is the Galois group of the splitting field of fo{x) = f{x, 1), a{G) 
is the number of maps g in G that have fixed points, Pf{p^) is the number 
of solutions {x,y) £ (Z/p^Z) x (Z/p^Z) to f{x,y) = modp^, and e > is 
arbitrary. The implied constant depends only on f and e. 

Proof (Sketch). Proceed as in Prop. [731 replacing Lem. [712] bv [22j . Prop. 
3.5. It remains to bound 

(7.18) \{l <x,y < N :3p> N"^ s.t. p'^\fix,y)}\. 

This we do by giving a bound for the number of rational points on Gr '■ 
ry"^ = g{x) with 1 < r < A^^, where g{x) = /(x,l). We can do this by 
finding for the great majority of r (as in Prop. l^^ a divisor tQ\r, t^ > A^^""^, 
with few prime factors, and then using it as in [23j, §5. (As before, a divisor 
tQ\r of the right size will exist for all r outside a small set, viz., a set of 
cardinality ^ A^^(log A^)~^, A arbitrary.) The divisor to is large enough 
to increase the angle given by Mumford's gap principle to 7r/2 — 0(e). We 
can then apply sphere-packing results (Lem. 14. ip . bounding the rank of 
Gr in terms of w{r) as in [22], Prop. 4.22 (that is, using [S], though we 
may use the more general statements in [39j instead). Let D be a positive 
integer that will be set later. We consider all r < -D such that (a) r has 
a divisor as above, and (b) the Frobenius element in G = Gal^ of every 

p\r has fixed points. There are <C/ -D(log A^) I'^l such integers r. Since 
the bound on the number of rational points per r coming from sphere- 
packing is (log A^)*^ for all r outside a small set, we obtain a total bound of 

< / Dilog N) \G\ +^ Since rp^ = f{x) with r>D implies p < N^/VD, 
we can bound the contribution to (|7.18|) of solutions to rp'^ = f{x) with 

r > Dhy Of i^-^-JL_^y Set D = {logN) ^igi n'^, We conclude that 

1 I g-(G) I 

dZH]) is at most 0/(Ar2(logAf) ^+W+'). □ 
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Remark. Pairs of integers {x,y) G n [1, A^] are numerous enough that 
considerations of entropy are not needed to prove Prop. 17.71 and, in fact, 
would not help. Thus, the situation is similar to that in Prop. 17.31 and the 
contrary of the situation in every other result in this paper. 
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